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Computational Complexity of Decoding Orthogonal Space-Time Block Codes 

Ender Ayanoglu*, Erik G. Larsson^, and Eleftherios Karipidis^ 

Abstract 

The computational complexity of optimum decoding for an orthogonal space-time block code Qm satisfying 
GnGn = c (S/cLi \sk\ 2 )lN where c is a positive integer is quantified. Four equivalent techniques of optimum 
decoding which have the same computational complexity are specified. Modifications to the basic formulation in 
special cases are calculated and illustrated by means of examples. This paper corrects and extends [1],[2], and 
unifies them with the results from the literature. In addition, a number of results from the literature are extended 
to the case c > 1. 

I. Introduction 

In [3], an optimum Maximum Likelihood metric is introduced for Orthogonal Space-Time Block Codes 
(OSTBCs). A general description of this metric and specific forms for a number of space-time codes can 
be found in [4]. This metric is complicated and, in a straightforward implementation, its computational 
complexity would depend on the size of the signal constellation. By a close inspection, it can be observed 
that it can actually be simplified and made independent of the constellation size. Alternatively, the 
Maximum Likelihood formulation can be made differently and the simplified metric can be obtained via 
different formulations [5], [6]. In [1],[2], yet another formulation is provided. In this paper, we will unify 
all of the approaches cited above and calculate the computational complexity of the optimum decoding 
of an OSTBC. We will begin our discussion within the framework of [1],[2]. 

Consider the decoding of an OSTBC with N transmit and M receive antennas, and an interval of T 
symbols during which the channel is constant. The received signal is given by 

Y = G N H + V (1) 

where Y = [y^rxM is the received signal matrix of size T x M and whose entry y{ is the signal 
received at antenna j at time t, t = 1,2, . . . ,T, j = 1,2 . . . , M; V = \vI\txm is the noise matrix, and 
= [giWxN is the transmitted signal matrix whose entry g\ is the signal transmitted at antenna i at time 
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t, % — 1, 2, . . . , N. The matrix H = [hij] NxM is the channel coefficient matrix of size N x M whose entry 
hi j is the channel coefficient from transmit antenna i to receive antenna j. The entries of the matrices H 
and V are independent, zero-mean, and circularly symmetric complex Gaussian random variables. Gn is 
an OSTBC with complex symbols s k , k = 1, 2, . . . , K and therefore G^Gn = c(J2k=i \ s k\ 2 )lN where c 
is a positive integer and I N is the identity matrix of size N. 

II. A Real- Valued Representation 

Arrange the matrices Y, H, and V, each in one column vector by stacking their columns on top of 
one another 

y = vec(Y) = (yl,...,y¥) T , (2) 
h = vec(H) = {h lfl , . . . , h NtM ) T , (3) 
v = vec(V) = (v\,...,v¥) T - ( 4 ) 



Then one can write 



y = Gnh + v (5) 



where Gn = Im ® Gn, with ® denoting the Kronecker matrix multiplication. In [1],[2], a real-valued 
representation of CD is obtained by decomposing the MT-dimensional complex problem defined by © 
to a 2MT-dimensional real-valued problem by applying the real-valued lattice representation defined in 
[7] to obtain 

y = Hx + v (6) 

where 

y = (Re(yl),Im(yl),...,Re(y¥),My¥)) T , (V) 
x = (Re(si),Im(si), . . . ,Re(s x ),Im(sx)) T , (8) 
v = (Re(w 1 1 ),Im(w 1 1 ),...,Re(4 f ),Im(4 / )) T - (9) 

The real-valued fading coefficients of H are defined using the complex fading coefficients hij from 
transmit antenna i to receive antenna j as /i2i-i+2(i-i)jv = Re(/iij) and /i2i+2(j-i)jv = I m (^i,j) for 
i = 1,2, ... ,N and j = 1, 2, ... , M. Since Gn is an orthogonal matrix and due to the real- valued 
representation of the system using ©, it can be observed that the columns hi of H are orthogonal to 
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each other and their inner products with themselves are a constant [1],[2] 

H T H = al 2K . (10) 
By multiplying © by H T on the left, we have 

y — ax + v (11) 



where y = H T y, and v = H T y is a zero-mean random vector. Due to (flOl) . v has independent and 
identically distributed Gaussian members. The Maximum Likelihood solution is found by minimizing 

Hy-eraclla (12) 

or equivalently 

h^y-xWl (13) 

over all combinations of x E Vt 2K . As a result, the joint detection problem of an OSTBC decouples into 
K symbol detection problems 



(jfyk-lilhk) - (^2fc-l,^2fe)|| 2 ( 14 ) 



one per symbol (x 2 k-i, x 2 k) £ where k = 1, 2, . . . , K. Further, we assume that the signal constellation 
is separable as Vt 2 where Vt = {±1, ±3 . . . , ±(2L — 1)}, and L is an integer, the Maximum Likelihood 
decoding problem can be further simplified to 

mm\x k -x k \ 2 (15) 

where we denoted 

x k = a~%, k = l,2,...,2K, (16) 

which is a standard operation in conventional Quadrature Amplitude Modulation (QAM). In the sequel, 
we will compute the decoding complexity up to this quantization operation. 
The decoding operation consists of the multiplication 



V = H T y, 



(17) 
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the calculation of 

a = hlh x , (18) 

the inversion of a, and the multiplications in (TT61 ). 

In what follows, we will show that when G^Gn = c (^2k=i \ s k\ 2 )lN where c is a positive integer, then 
a = c\\H\\ 2 . The development will lead to the four equivalent optimal decoding techniques discussed in 
the next section. 

Let Sk = Re[sfc] and 5k = lm[sk\. Form two vectors, s and s, consisting of Sk and §k, respectively, and 
form a vector s' that is the concatenation of s and s 

s = (s 1 ,s 2 ,...,s K ) T , s = (si, s 2 , ■ ■ ■ ,s K ) T , s' = (s t ,s t ) t . (19) 
By rearranging the right hand side of ©, we can write 

y = Fs' + v = F a s + F b s + v (20) 

where F = [F a Fb] is an MT x 2K complex matrix and F a and F b are MT x K complex matrices 
whose entries consist of (linear combinations of) channel coefficients h iy j. In [5], it was shown that when 
G§Gn = {J2k=i \sk\ 2 )lN, then Re[F H F] = \\H\\ 2 I N . It is straightforward to extend this result so that 
when GnGn = c(52 k =i \ s k\ 2 )lN, then 

Re[F H F] = c\\H\\ 2 I (21) 

where c is a positive integer. Let 

y = Re[y], y = lm[y], v = Re[v], v = Tm[v], (22) 

and 

F = Re[F a ], F = Im[F a ], F b = Re[F b ], F b = lm[F b ]. (23) 

Now define 



y 


F' = 


K 


F b 


v' = 


V 


. y . 






F b _ 




V 



(24) 



so that we can write 

y ' = F's' + v (25) 
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which is actually the same expression as © except the vectors and matrices have their rows and columns 
permuted. 

It can be shown that (I2TT) implies 

F' T F' = c\\H\\ 2 L (26) 
Let P y and P s be 2MT x 2MT and 2K x 2K, respectively, permutation matrices such that 

y = P y y', x = P s s'. (27) 
It follows that pTPy = PyPT = I and PjP s = P s Pj = I. We now have 

y = P y (F's' + v') = PyF'Pjx + P y v =Hx + v. (28) 



Therefore, 



which implies 



H = PyF'Pj (29) 



H T H = P s F' T pTp y F'pT = c\\H\\ 2 L (30) 



As a result, a = c\\H\\ 2 . 

III. Four Equivalent Optimum Decoding Techniques for OSTBCs 

For an OSTBC Q N satisfying G^Gn = c (5^fc=i l s fc|| 2 )-^Af where c is a positive integer, the Maximum 
Likelihood solution is formulated in four equivalent ways with equal squared distance values 

\\Y - GnH\\ 2 = \\y - Fs'\\ 2 = \\y' - F's'\\ 2 = \\y - Hx\\ 2 . (31) 

There are four solutions, all equal. The first solution is obtained by expanding \\Y — GnH\\ 2 and is given 
by eq. (7.4.2) of [5] when c = Hi When c > 1, it should be altered as 

§ k = -l^iMMH H A* Y)} - i ■ Im{Tr{H H B*Y)}) k = l,2,...,K (32) 
c ll-" II 

where A k and B k are the matrices in the linear representation of Gn m terms of s k and s k for k = 



1,2,..., If as 



K K 



Gn = ^ s k A k + is k B k = s k A k + s* k B k , (33) 

k=l k=l 
'The notation in [4] and [5] is the transposed form of the one adopted in this paper. 



i = A k = A k + B k , and B k = A k — B k [5]. Once {s fc }^i are calculated, the decoding problem 

can be solved by 

min \s k - Re[s fc ]| 2 , min \s k - lm[s k ]\ 2 (34) 

once for each k = 1, 2, . . . , K. Similarly to (fT5l) . this is a standard quantization problem in QAM. 
The second solution is obtained by expanding the second expression in (I3TI) and is given by 

MF H y] 



c\\H\ 



(35) 



This is given in [4. eq. (7.4.20)] for c = 1. The third solution corresponds to the minimization of the third 
expression in (I3TI) and is given by 

- F ' T y' ™ 

s = 4W (36) 

The fourth solution is the one introduced in [1]. It is obtained by minimizing the fourth expression in 
(I3TI) and is given by 

a c\\H\\ 

Considering that 

F a = [vec(A x i?) • ■ ■ vec(A K H)) F b = [i ■ vec( J B 1 if) ■■■%■ vec{B K H)) (38) 



[4, eq. (7.1.7)], it can be verified that (1321) and (1351) are equal. The equality of (1351) and (1361) follows from 
(|22l) -(|24 ]) . The equality of ([36]) and ([37]) follows from d27J and Therefore, equations ([32]), (|35])-([37T) 
yield the same result, and when properly implemented, will have identical computational complexity. 

Although these four techniques are equivalent, a straightforward implementation of (l32l) or ([35]) can 
actually result in larger complexity than (|36"1) or ([37]) . The proper implementation requires that in ((32l) or 
(l35l) . the terms not needed due to elimination by the Tr[ ], Re[ ], and Im[ ] operators are not calculated. 

Let's now compare these techniques with the minimization of the metric introduced in [3]. For a complex 
OSTBC, let [3], [4] 

M 

r k = ^2 ^2sgn t (k)h e{k)d g(k) (39) 

terj(fc) j=l 

where r]{k) is the set of rows of Qn in which s k appears, e t (k) expresses the column position of s k in 



the tth row, sgn t (/c) denotes the sign of s k in the tth row, 



h* t / k sj if s k is in the tth row of Qn, 

l et{k),j = < ' (40) 



h e , k -<j if si is in the tth row of Q 



N 



and 



y{ if s k is in the tth row of Q 



N, 



m) = { . ; (41) 

{ylY if s* k is in the tth row of Qn 

for k = 1, 2, . . . , K. A close inspection shows that r k in (f39l)-(f4Tb is equal to the numerator of (1321) . 
The metric to be minimized for s k is given as [3], [4] 

tJV M \ 
c^^l^f-l |s fc | 2 . (42) 
i=l 3=1 / 



Implemented as it appears in (|42l) . this metric has larger complexity than the metrics for four equivalent 
techniques described above. Furthermore, its complexity depends on the constellation size L due to the 
presence of the factor |sfc| 2 . It can be simplified, however. 
For minimization purposes, we can write (l42l) as 



|s fc | 2 - 2Re[s* k r k ] + \r k \ 2 + c||iJ|| 2 |s fc | 2 - \s k \ 



1 2 / i ,2 2Re[s£r fc ] , |r fc | 2 



° m 1 lSkl ~ w + Mr 1 + const (43) 



c\\H\ 



s k ~ [TT7T 
c\\H\ 



2 



+ const. 



where the first equality follows from the fact that the third term inside the paranthesis in (1431) is independent 
of s k . Because of our observation that r k is the same as the numerator of (|32|) . we have 



s^^lkb- k=l,2,...,K (44) 
c ll-n II 

and then this method becomes equivalent to our four equivalent techniques. 

IV. Optimum Decoding Complexity of OSTBCs 

Since the four decoding techniques (l32l) . (|35T)-(|37T) are equivalent, we will calculate their computational 
complexity by using one of them. This can be done most simply by using (|36l) or (1371) . We will use (|37l) 
for this purpose. 

First, assume c = 1. Note H is a 2MT x 2K matrix. The multiplication H T y takes 2MT ■ 2K and 
calculation of a = \\H\\ 2 takes 2MN real multiplications, its inverse takes a real division, and o~ l y takes 
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2K real multiplications. Similarly, the multiplication H T y takes 2K ■ (2MT — 1), and calculation of a 
takes 2MN — 1 real additions. Letting R D , R M and R A be the number of real divisions, the number of 
real multiplications, and the number of real additions, the complexity of decoding the transmitted complex 
signal (si, s 2 , . . . , s K ) with the technique described in (fT71),(fT8l),(fT6l) is 

C = IR D , (AKMT + 2MN + 2K)R M , (AKMT + 2MN -2K- 1)R A . (45) 

Note that the complexity does not depend on the constellation size L. If we take the complexity of a real 
division as equivalent to 4 real multiplications as in [1],[2], then the complexity is 

C = (AKMT + 2MN + 2K + A)R M , (AKMT + 2MN -2K -l)R A (46) 

which is smaller than the complexity specified in [1],[2] and does not depend on L. In the rest of this 
paper, we will use this assumption. The conversion from this form to that in (l45l) can be made simply by 
adding a real division and reducing the number of real multiplications by 4. 

When c > 1, the number of real multiplications to calculate a increases by 1, however, in the examples 
it will be seen that the complexity of the calculation of H T y is reduced by a factor of c. 

In what follows, we will calculate the exact complexity values for four examples. See [3], [4] for explicit 
metrics of the form (|391)-(|42|) for these examples. 

Example 1: Consider the Alamouti OSTBC with N = K = T = 2andM=l where 



The matrix H can be calculated as 

H 

Note that the matrix H is orthogonal and all of its columns have the same squared norm. One needs 16 real 
multiplications to calculate y = H T y, 4 real multiplications to calculate a = hjhi, 4 real multiplications 
to calculate a -1 , and 4 real multiplications to calculate <r~ l y. There are 3 ■ 4 = 12 real additions to 
calculate H T y and 3 real additions to calculate a. As a result, with this approach, decoding takes a total 



si s 2 

~ S 2 S l 



hi 


-h 2 


h 3 


—h^ 


h 2 


hi 


hi 


h 3 


h 3 


hi 


-h 


-h 2 


h{ 


-h 


-h 2 


hi 



(47) 



(48) 
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of 28 real multiplications and 15 real additions. 

The complexity figures in (|46l) are 28 real multiplications and 15 real additions, which hold exactly. 
Example 2: Consider the OSTBC with M = 2, N = 3, T = 8, and K = 4 given by [8] 

T 



51 -s 2 

5 2 S 1 

53 Si Si 



S 3 



Si ST 



S4 —S3 s 2 s 1 



s 4 s 3 



S2 



s 3 s 4 



(49) 



For this Q N , one has G3G3 = 2 fX/fLi |sfc| 2 j I3. In [2], it has been shown that the 32 x 8 real-valued 
channel matrix H is 



H 





-h 2 


h 3 


—h/i 


h 5 


-h e 








ho 


hi 


hi 


h 3 


h 6 


h 5 








h 7 


-h 8 


hg 


-ho 


hn 


-hu 








hs 


h 7 


ho 


hg 


hn 


hn 














hn 


hv2 


-hg 


-hw 


-h 7 


-h 








hi2 


-hn 


-h w 


hg 


-h 8 


h 7 



(50) 



where hi, i = 1,2, ... ,11 and hj, j = 2, 4, . . . , 12 are the real and imaginary parts, respectively, of h^i, 
h2,\, ^3,1, h\^, h2,2, ^3,2- The matrix H T is 8 x 32 where each row has 8 zeros, while each of the remaining 
24 symbols has one of hi, h 2 , ■ ■ ■ , h i2 , repeated twice. Let's first ignore the repetition of hi in a row. Then, 
the calculation of H T y takes 8 • 24 = 192 real multiplications. The calculation of a = hjhi = 2 ^2 k=1 hj 
takes 12 + 1 = 13 real multiplications, In addition, one needs 4 real multiplications to calculate a~ l , and 
8 real multiplications to calculate <J~ l y. To calculate H T y, one needs 8 • 23 = 184 real additions, and to 
calculate a, one needs 11 real additions. As a result, with this approach, one needs a total of 217 real 
multiplications and 195 real additions to decode. 

For this example, (|46|) specifies 300 real multiplications and 279 real additions. The reduction is due 
to the elements with zero values in H. 

It is important to make the observation that the repeated values of hi in the columns of H, or equivalently 
h* m n in the rows of H H A^ or H H , have a substantial impact on complexity. Due to the repetition of 
hi, by grouping the two values of yj that it multiplies, it takes 8 • 12 = 96 real multiplications to compute 
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H T y, not 8-24 = 192. The summations for each row of H T y will now be carried out in two steps, first 12 
pairs of additions per each hi, and then after multiplication by hi, addition of 12 real numbers. This takes 
12 + 11 = 23 real additions, with no change from the way the calculation was made without grouping. 
With this change, the complexity of decoding becomes 121 real multiplications and 195 real additions, a 
huge reduction from 300 real multiplications and 279 real additions. 

Example 3: We will now consider the code Q4 from [8]. The parameters for this code are N = K = 4, 
M = 1, and T = 8. It is given as 

T 



51 — s 2 —s 3 — S4 s 1 — s 2 —s 3 — s 4 

5 2 Si S 4 —S3 S 2 s* 



s 4 s 3 



S3 



-S4 Si 



S-2 



S4 S3 



S 2 Si 



(51) 



Similarly to Qs of Example 2, this code has the property that G4G4 = 2(52 k=i \sk\ 2 )h- The H matrix is 
16 x 8 and can be calculated as 



H 



h 


-h 2 


h 


—hi 


h 5 


-h 6 


li- 


hs 


h 2 


hi 




hs 


h 6 


h 


tis 


h 7 


h 


—h^ 


-h 


h 2 


h 7 


-h 8 


-h 5 


he 


fii 


h 3 


-h 2 


-h 


hs 


h 7 


-he 


-h 5 


h 5 


he, 


-h 7 


hs 


-h 


-h 2 


h 3 


hi 


h 6 


-h 5 


-h & 


hi 


-h 2 


hi 


hi 


-hs 



(52) 



This matrix consists entirely of nonzero entries. Each entry in a column equals ±hi for some i £ 
{1,2, ...,8}, every hi appearing twice in a column. Ignoring this repetition for now, calculation of 
H T y takes 8-16 = 128 real multiplications. Calculation of a takes 9 real multiplications, its inverse 
4 real multiplications, and the calculation of o~ l y takes 8 real multiplications. Calculation of H T y takes 
8 • 15 = 120 real additions, and calculation of a takes 7 real additions. As a result, with this approach, to 
decode, one needs 149 real multiplications and 127 real additions. 

For this example, equation (|46l ) specifies 156 real multiplications and 135 real additions. The reduction is 
due to the fact that one row of H T has each hi appearing twice. This reduces the number of multiplications 
and summations to calculate a by about a factor of 2. 
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However, because each hi appears twice in every row of H T , the number of multiplications can actually 
be reduced substantially. As discussed in Example 2, we can reduce the number of multiplications to 
calculate H T y by grouping the two multipliers of each hi by summing them prior to multiplication by hi, 
i = 1, 2, . . . , 8. As seen in Example 2, this does not alter the number of real additions. With this simple 
change, the number of real multiplications to decode becomes 85 and the number of real additions to 
decode remains at 127. 

Example 4: It is instructive to consider the code H 3 given in [8] with iV = 3, K = 3, T = A which 
we will consider for M = 1 where 



H 3 



si s 2 s 3 /V2 

s 2 s i s 3 /V2 
sl/V2 sl/V2 - s\ + s 2 - s*)/2 

s* 3 /V2 s* 3 /V2 (s 2 + s* + Sl -s{)/2 

For this code, H^Hz = (J2k=i l s fc| 2 )^3 * s satisfied. In this case, the matrix H can be calculated as 



(53) 



H 



hi 


—hi 


h 3 


—hi 


h 5 /y/2 


-h 6 /V2 


h 2 


hi 


hi 


h 


h/V2 




h 3 


h^ 


-hi 


-h 2 


h/V2 


-h/V2 


hi 


-h 3 


-h 2 


hi 




h 5 /V2 


-h 5 








-h 6 


{hi + h 3 )/V2 


(h 2 + hi)/V2 


-h 6 








h 5 


(h 2 + hi) /y/2 


-(h + h 3 )/V2 





h 6 


h 5 





(h - h 3 )/V2 


(h 2 - hi)/V2 





-h 5 


he 





(h 2 - hi)/V2 


(-h! + h 3 )/V2 



(54) 



It can be verified that every column hi of H has the property that hjhi = o = \\H\\ 2 = Ylk=i f° r 
% = 1, 2, . . . , 6. In this case, the number of real multiplications to calculate H T y requires more caution 
than the previous examples. For the first four rows of H T , this number is 6 real multiplications per row. 
For the last two rows, due to combining, e.g., hi and h 3 in (hi + h 3 )/\/2 in the fifth element of h 5 , and 
the commonality of h 5 and h 6 for the first and third, and second and fourth, respectively, elements of h 5 , 
and one single multiplier l/y/2 for the whole column, the number of real multiplications needed is 7. As 
a result, calculation of H T y takes 38 real multiplications. Calculation of a takes 6 real multiplications. 
One needs 4 real multiplications to calculate a^ 1 , and 6 real multiplications to calculate a~ l y. First four 
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rows of H T y require 5 real additions each. Last two rows of H T y require 4 + 7 = 11 real additions each. 
This is a total of 42 real additions to calculate H T y. Calculation of a requires 5 real additions. Overall, 
with this approach one needs 54 real multiplications and 47 real additions to decode. 

For this example, (|46l) specifies 66 real multiplications and 49 real additions. The reduction is due to 
the presence of the zero entries in H. On the other hand, the presence of the factor 1 / y/2 in the last two 
rows of H T adds two real multiplications to the total number of real multiplications. 

V. Conclusion 

Equation (1431 ) yields the computational complexity of decoding an OSTBC when its H matrix consists 
only of nonzero entries in the form of hi when c = 1. It should be updated as specified in the paragraph 
following (14"6l) when c > 1. The presence of zero values within H reduces the computational complexity. 
In the examples its effect has been a reduction in the number of real multiplications to calculate H T y by a 
factor equal to the ratio of the rows of and B/. that consist only of zero values to the total number of all 
rows in and Bk for k = 1, 2 . . . , K, with a similar reduction in the number of real additions to calculate 
H T y. With the modifications outlined above, (1431 ) specifies the computational complexity of decoding the 
majority of OSTBCs. In some cases, the contents of the H matrix can have linear combinations of hi 
values, which result in minor changes in computational complexity as specified by this formulation, as 
shown in Example 4. Finally, note that L = 2 is a special case where the signal belongs to one of the 
four quadrants, calculation of and division by c||7J|| 2 are not needed and the computational complexity 
will be correspondingly lower. 
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